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METHOD AND SYSTEM FOR STORING LARGE DATA FILES 

Background of the Invention 

Field of the Invention 

The present invention relates to a system and method for storing large data files 
on a computer or computer network. More specifically, the present invention is related 
to the formatting of large data files to promote efficient data storage and transmission. 
Statement of the Prior Art 

Data is typically maintained for storage and retrieval in computer file systems, 
wherein a file comprises a collection of data or a set of records. A file system provides a 
collection of files and file management structures on a physical or logical storage device 
such as a disk or other memory device. A file system stores data in files, and provides 
an application programming interface (API) to facilitate accessing data stored on a 
storage medium such as a disk or other memory medium. A file system API provides 
various functions that are invoked by an application program to access data. Application 
programs control the internal format of a file and determine which data to store in which 
files. A file system typically allows files to be grouped into directories. Each directory 
may contain many files and sub-directories. A file system that groups files into 
directories and sub-directories is referred to as a hierarchical file system. 

There is a continuing need to improve the access and control over file systems 
storing large quantities of variable-sized data records used in a large variety of 
applications. Applications involving accessing and controlling large quantities of stored 
data are found in the public sector, E-commerce, financial/insurance industry, travel 
industry, publishing industry, graphic arts industry, advertising industry and any other 
industry which requires managing large data files. 

Examples where large amounts of data are stored in files in a hierarchical file 
system include database, logistics, and enterprise solutions software used by the 
financial, health and distribution industries, among others. Database, logistics and 
enterprise solutions software include an API to access large quantities of data. 

In another example, computer-aided design (CAD) drawings prepared by 
architects, engineers, designers, planners, and the like require large amounts of data to 
be stored in files in a hierarchical file system. CAD software includes an API to access 
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the large quantities of data. Applications such as, e.g., MicroStation® products, which 
are developed by Bentley Systems, Inc., Exton, Pennsylvania U.S.A., and AutoCAD® 
products, which are developed by Autodesk, Inc., San Rafael, California, U.S.A. are 
typical of such CAD software, which may be used in the Engineering, Construction, and 
5 Operations (ECO) marketplace. U.S. Patent Number 6,063,128 provides an example of 
a CAD system. 

A typical CAD project employed in the engineering context is stored in numerous 
files. Each file typically contains one or more engineering models, each of which 
represents an engineering domain (e.g., structural, electrical, mechanical, plumbing). 
10 Moreover, each engineering model requires numerous items represented by a series of 
elements to support the complex and precise nature of each design. In this context, the 
term "element" is used to mean a record containing a variable number of bytes of data 
B arranged in a format that can be interpreted by a program. The term "element" differs 
^ from the common notion of an "object" in that each element can have a variable number 
r§5 of bytes, whereas the size of an object is typically defined by It's class. It is the variable- 
^1 sized nature of elements that causes their persistent storage to be complicated, 
Nj because they cannot be written in fixed sized records and arranged in tables, as is 
g typically done in relational databases, for example. 

?f Each item in a model is represented by at least one element or an aggregation of 

120 elements. For example, a structural drawing can hold the column and beam layout for a 

floor plan, which are internally represented by lines, squares and rectangles and 
H additional properties. In this example, an individual beam may be a collection of lines, 

squares and rectangle elements. The structure of the floor plan may be more complex 

and require many nested levels of elements to accurately provide a structural 
25 representation. 

Accordingly, as the complexity of the project increases, the size of the CAD files 

also increases. As a result, CAD files become very large and efficient access and 

control of these large files is important. Conventionally, there are two approaches to 

storing these large data files. 
30 In the first approach, the elements are stored and accessed as a sequential list, 

each element having a fixed header containing the element's size. Storing data in this 
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manner requires that the file be read sequentially from the beginning to the end. 
Typically, a program will read the elements from the file into memory and, at the same 
time, also store the "file position" of each element in memory. 

This approach is well suited for the common scenario where a large number of 
elements are read from the disk, while only a small minority of them are modified during 
a single editing session. In this case, modified elements can often be rewritten to the file 
by a simple seek-and-write to the appropriate file position on an element-by-element 
basis. Unfortunately, this only works for elements whose contents change, but whose 
size in bytes remains the same or becomes smaller. When elements become larger 
during an editing session, they must be deleted from their original file position and 
moved to the end of the file. This tends to leave "holes" (deleted elements occupying file 
space) in the file that can only be removed by rewriting the entire file. Further, the size 
of disk file can grow quite large, because it is not possible to remove deleted entries 
from the file without rewriting the entire file, and invalidating all in-memory element 
positions. 

The second approach is to apply a compression algorithm to the element data 
before it is written to the file. This can often result in substantial savings in the resultant 
file size, because many applications have element data typically containing a great deal 
of redundancy. However, with this approach, element data cannot be saved 
incrementally, because a change to a single element can result in an entirely different 
compressed file. 

Another consideration for the file storage approach is the typical requirement to 
allow multiple users to simultaneously access models when collaborating on a project. 
Typically, a first user creates an original model, which multiple users may view and/or 
edit depending on the level of access granted to the user. Since communication 
between users typically occurs over a computer network, the CAD system must ensure 
that changes to the model or items in the model are properly coordinated and the 
models are kept in a consistent state at all times. It is understood that a computer 
network refers to any type of computer network including but not limited to a local area 
network, wide area network (e.g. Intranet), and the Internet. The Internet includes but is 
not limited to the World Wide Web. 
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Since computer systems used in many industries (such as ECO) demand 
efficient use of network resources and have the further requirement that the file system 
must perform in a multi-user environment, existing file storage approaches require 
substantial improvement in order to enhance the efficiency of the file system to support 
the increasing data storage requirements. Therefore, there is a continuing need for an 
enhanced file storage approach, which efficiently accesses and controls large quantities 
of data in a single user and multi-user environment. Moreover, there is a continuing 
need for an enhanced file format permitting access and control to large quantities of 
data to improve the efficient transfer and storage of large quantities of data. 

Summary of the Invention 

In preferred embodiments of the present invention, such problems are generally 
avoided by use of a method in a computer system for storing element chunks of 
elements in a storage area, comprising the steps of: (1 ) storing at least one root storage 
in a storage area; (2) storing a model directory in the at least one root storage; (3) 
storing at least one model in the model directory; (4) storing a graphic element list and a 
control element list in the at least one model, the graphic element list having element 
chunks containing graphic elements and the control element list having element chunks 
containing control elements; (5) assigning a preselected number of elements to each 
element chunk; and (6) allocating each element to an element chunk in one of the 
control element list and the graphic element list. 

According to a second embodiment of the present invention, a method in a 
computer system for storing element chunks of elements in a storage area, comprises 
the steps of: (1) storing at least one root storage in a storage area; (2) storing at least 
one model directory in each root storage; (3) storing a control model in each root 
storage; (4) storing at least one model in the model directory; (5) storing a graphic 
element list and a control element list in each model and each control model directory, 
the graphic element list having element chunks containing graphic elements and the 
control element list having element chunks containing control elements; (6) assigning a 
preselected number of elements to each element chunk; and (7) allocating each 
element to an element chunk in one of the control element list and the graphic element 
list. 
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A method in a computer system for modifying at least one compressed element 
chunk in a main storage area, in accordance with a third embodiment of the present 
invention, comprises the steps of: (1) storing at least one root storage in the main 
storage area; (2) storing a model directory in the at least one root storage; (3) storing at 
5 least one model including a graphic element list and a control element list in each model 
directory; (4) storing at least one compressed element chunk in each graphic element 
list and control element list, wherein each the compressed element chunk comprises 
compressed elements; (5) storing the at least one root storage in a temporary memory; 
(6) decompressing the compressed element chunks and compressed elements in the 
10 graphic element list and the control element list; (7) modifying elements in the 
temporary memory; (8) flagging the decompressed element chunks having the modified 
elements in the temporary memory with a dirty flag; (9) compressing the flagged 
f element chunks to provide modified compressed element chunks after a predetermined 
'it idle time has passed; and (10) replacing the compressed element chunks in the main 
n15 storage area with the modified compressed element chunks. 

Jt" According to a fourth embodiment of the present invention, a method in a 

computer system for modifying at least one encrypted element chunk stored in a main 
s storage area, comprises the steps of: (1) storing at least one root storage in the main 
J;f storage area; (2) storing a model directory in each root storage; (3) storing at least one 
N20 model including a graphic element list and a control element list in each model 
□ directory; (4) storing at least one encrypted element chunk in each graphic element list 
^ and control element list, wherein each element chunk comprises compressed elements; 
(5) storing the at least one root storage in a temporary memory; (6) decrypting the 
element chunks and the elements in the graphic element list and control element list; (7) 
25 modifying elements in the temporary memory; (8) flagging the element chunks having 
the modified elements in the temporary memory with a dirty flag; (9) encrypting the 
element chunks having the modified elements in temporary memory to provide modified 
encrypted element chunks after a predetermined idle time has passed; and (10) 
replacing the encrypted element chunks in the main storage area with the modified 
30 encrypted element chunks. 
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A method in a computer system for modifying at least one compressed encrypted 
element chunk in a main storage area, in accordance with a fifth embodiment of the 
present invention, comprises the steps of: (1 ) storing at least one root storage in the 
main storage area; (2) storing a model directory in each root storage; (3) storing at least 
one model including a graphic element list and a control element list in each model 
directory; (4) storing at least one compressed and encrypted element chunk in each 
graphic element list and control element list, wherein each element chunk comprises 
compressed elements; (5) storing the at least one root storage in a temporary memory; 
(6) decrypting the element chunks and elements in the graphic element list and control 
element list; (7) decompressing the element chunks and elements in the graphic 
element list and control element list; (8) modifying elements in the temporary memory; 
(9) flagging the element chunks having the modified elements in the temporary memory 
with a dirty flag; (10) compressing and encrypting the element chunks having the 
modified elements in temporary memory to provide modified compressed encrypted 
element chunks after a predetermined idle time has passed; and (11) replacing the 
compressed encrypted element chunks in the main storage area with the modified 
compressed encrypted element chunks. 

According to a sixth embodiment of the present invention, a computer system for 
modifying at least one compressed chunk of elements stored in a main storage area, 
comprises: (a) a temporary memory; (b) a root storage stored in the main storage area 
and the temporary memory; (c) a model directory stored in the root storage; (d) at least 
one model stored in the model directory; (e) at least one graphic element list stored in 
each model; (f) at least one control element list stored in each model; (g) at least one 
compressed element chunk stored in each graphic element list and control element list, 
wherein each element chunk comprises compressed elements; (h) means for 
decompressing the element chunks and elements in the graphic element list and control 
element list in the temporary memory; (i) means for modifying elements in the 
temporary memory; (j) means for lagging the element chunks having the modified 
elements in the temporary memory; (k) means for compressing the element chunks 
having the modified elements in temporary memory to provide modified compressed 
element chunks after a predetermined idle time has passed; and (I) means for replacing 
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the compressed element chunks in the main storage area with the modified compressed 
element chunks. 

A computer system for modifying at least one encrypted element chunk stored in 
a main storage area, in accordance with a seventh embodiment of the present 
invention, comprises: (a) a temporary memory; (b) a root storage stored in the main 
storage area and the temporary memory; (c) a model directory stored in the root 
storage; (d) at least one model stored in the model directory; (e) at least one graphic 
element list stored in each model; (f) at least one control element list stored in each 
model; (g) at least one encrypted element chunk stored in each graphic element list and 
control element list, wherein each element chunk comprises encrypted elements; (h) 
means for decrypting the element chunks and elements in the graphic element list and 
control element list in the temporary memory; (i) means for modifying elements in the 
temporary memory; (j) means for flagging the element chunks having the modified 
elements in the temporary memory with a dirty flag; (k) means for encrypting the 
element chunks having the modified elements in temporary memory to provide modified 
encrypted element chunks after a predetermined idle time has passed; and (I) means 
for replacing the encrypted element chunks in the main storage area with the modified 
encrypted element chunks. 

According to an eighth embodiment of the present invention, a computer system 
for modifying at least one encrypted compressed element chunk stored in a main 
storage area, comprises: (a) a temporary memory; (b) a root storage stored in the main 
storage area and the temporary memory; (c) a model directory stored in the root 
storage; (d) at least one model stored in the model directory; (e) at least one graphic 
element list stored in each model directory; (f) at least one control element list stored in 
each model directory; (g) at least one encrypted compressed element chunk stored in 
each graphic element list and control element list, wherein each element chunk 
comprises encrypted compressed elements; (h) means for decrypting the element 
chunks and elements in the graphic element list and control element list in the 
temporary memory; (i) means for decompressing the element chunks and elements in 
the graphic element list and control element list in the temporary memory; (j) means for 
modifying elements in the temporary memory; (k) means for flagging the element 
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chunks having the modified elements in the temporary memory with dirty flags; (I) 
means for compressing and encrypting the element chunks having the modified 
elements in temporary memory to provide modified encrypted compressed element 
chunks after a predetermined idle time has passed; and (m) means for replacing the 
encrypted compressed element chunks in the main storage area with the modified 
encrypted compressed element chunks. 

Other objects, advantages, and novel features of the present invention will 
become readily apparent from the following detailed description thereof, when 
considered in conjunction with the accompanying drawings wherein: 

Brief Description of the Drawings 

Figure 1 shows a block diagram illustrating a high-level exemplary network 
environment for handling large data files; 

Figure 2 shows a high level exemplary workstation adapted to transfer, receive 
and/or store large data files; 

Figures 3-7 show the file format of the present invention; 

Figures 8A-8B show the creation and storage of element chunks in accordance 
with two embodiments of the present invention; 

Figures 9A-9B show block diagrams for automatically storing element chunks 
having modified elements in accordance with other embodiments of the present 
invention; 

Figure 10 shows a block diagram for automatically storing element chunks 
having modified elements in accordance with another embodiment of the present 
invention; 

Figures 11 -11 A show block diagram for automatically storing element chunks 
having modified elements and notifying users of storage of modified elements in a multi- 
user environment in accordance with other embodiments of the present invention; and 

Figure 12 shows a block diagram for automatically storing element chunks 
having modified elements and notifying users of storage of modified elements in a multi- 
user environment in accordance with another embodiment of the present invention. 
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Detailed Description of the Preferred Embodiments 
The present invention relates to providing an enhanced file format to store large 
quantities of variable-sized data records on a storage medium and to permit efficient 
access and control over data stored in the enhanced file format. A file format is the 
structure of a file that defines the way it is stored and laid out on a disk. In a preferred 
embodiment of the present invention, a file format includes a model directory containing 
models. Models include element lists comprising element chunks. The element chunks 
include elements, where elements are variable-sized data records. 

Referring now to the drawings, wherein like reference characters and numerals 
designate like or corresponding parts throughout each of the several views, there is 
shown in Figure 1 a diagram illustrating a high-level exemplary network environment for 
handling large data files in accordance with the file system and file format of the present 
invention. The file format of the present invention is intended to be utilized in any 
network environment such as the Internet, Intranet, local area network, etc. for any 
application, but may also be utilized on stand-alone computer workstations and/or 
processing units. By use of the term "Internet," it should be understood that the 
foregoing is not intended to limit the present invention to a network also known as the 
World Wide Web. 

The network 10 is a computer system adapted to be accessed by a plurality of 
clients. Such clients, in turn, suitably comprise one or more conventional workstations 
20, wireless computing devices 25, or other end user devices operating as a "fat" or a 
"thin" client. It should be understood, nevertheless, that wireless computing devices 25 
include computing devices such as Web-enabled hand-held devices such as, e.g., a 
Palm V™ type devices manufactured by Palm, Inc., Santa Clara, California U.S.A., 
Windows CE devices, pagers, and "smart" phones, which can use a wireless access 
protocol; Internet appliances; and other computing and end user devices which operate 
via satellite, infrared, remote sensor or other means not requiring a physical connection 
or coupling to another computing device, appliance or network, also fall within the spirit 
and scope of the present invention. 

Clients of all of the above types can suitably access the network 10 and the main 
storage area 30 through the network 10. The network 10 may be directly coupled to the 
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main storage area 30 or the network 10 may access the main storage area 30 through 
firewall 40, load balancer 50, network servers 60, load balancer 70 and application 
servers 80. The load balancer 50 and load balancer 70 may be the same type of unit. 
These load balancers 50 and 70 route traffic to and from the main storage area 30. It is 
understood that the present invention is not limited to this arrangement. However, the 
network must access a main storage area 30, which is either internal to the network 10 
or external to the network 10. The main storage area 30 is a computer readable 
medium. 

Figure 2 shows a block diagram of a workstation 20 coupled to the network 10. 
Workstation 20 includes one or more processors 100 coupled to a bus 105, The bus 
105 can be coupled to any of various subsystems including: a temporary memory 110; 
a secondary memory 112 such as, a disk 114, and/or a removable storage drive 116 
into which media 118 can be placed including, e.g., a diskette, a compact diskette (e.g. 
CD ROM) or the like; an input device such as a mouse 120, or a keyboard 125; an 
output device such as a display 130 or printer 135; and input/output (I/O) devices to a 
network 10 such as network interface card (NIC) 140 such as an Ethernet, Token Ring, 
Smart and Asynchronous Transfer Mode (ATM) cards. Other input/output devices may 
include a modem 145, or other input/output device such as, a wireless interface 150 
(e.g. a wireless transceiver). It will be apparent to those skilled in the relevant art that 
the above-described workstation 20 has been provided as an example and is not 
intended to limit the breadth of the invention in any way. In addition, it is understood that 
the wireless computing device 25 includes several of these components and/or is 
adapted to interface with a workstation 20 having these and/or similarly functioning 
components. For the purposes of the specification and claims, it is understood that the 
wireless computing device 25 is functioning as a workstation 20. Therefore, it is 
understood that any further reference to a workstation in the specification, drawings or 
claims includes both wired and wireless computing devices. 

Figure 3 shows a file format (internal structure) 200 of a file of the present 
invention, which is preferably stored in the main storage area 30 and may be stored in 
temporary memory 110 in one or more workstations 20. One or more files having the file 



-10- 



36488-174713 



format 200 are stored in the main storage area 30 and more than one file may be stored 
in temporary memory in the workstation 20. 

Within files created using the file format of the present invention, information is 
organized hierarchically; utilizing a "Storage and Stream" approach. A "Stream" is a 
variable-sized sequence of bytes that is assigned an identifier, such as a name, and can 
be saved persistently. In this respect, a Stream can be analogous to a "file" in a disk file 
system. A "Storage" can be analogous to a file system "directory"; it has an identifier 
such as a name, and can contain Streams or other Storages. 

In the present invention, the starting point for reading or creating files is the root 
storage 210. The root storage 210 contains other storages and streams including, but 
not limited to, those defined by the current invention. It should be obvious that since the 
root storage 210 is created using well known programming interfaces, other applications 
can create other storages and streams 211 within a file created by the present invention 
without corrupting the other items in the file. However, programs creating or reading 
files having the file format of the present invention may store one or more streams of a 
known format such as the following: 

- File Header stream 212 to hold information about the file itself, such as 
whether the file is compressed or encrypted, and if so, what techniques are 
employed in the compression and/or encryption; 

- Session Information stream 213 to hold information about the current editing 
session, such as the name of the current user editing the file; 

- Manifest stream 214 to hold identification information about the file, such as a 
globally unique number assigned by an authentication source, and a 
Universal Resource Locator to find the authentication source; 

- File Properties stream 215 to hold information about the file that are exposed 
to other applications using well-known interfaces such as Microsoft's 
IPropertySet interfaces. 

An important purpose of the present invention is to provide a system and method 
to store large sets of variable-sized elements. For convenience, related elements are 
grouped together in a model 310. Models 310 may group elements for any purpose 
relevant to an application, but are frequently intended to reflect some real world 
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structure. It is often desirable to store multiple models 310 in the same file. To 
accomplish this, a storage of the root storage 210 called the model directory 300 is 
created. The model directory 300 holds a list of models 310, each with a unique name 
(Figure 4). Alternatively, if only a single model per file is desired, the model 310 can be 
5 stored directly in the root storage 210 without creating a model directory 300. 

In many cases, it may be necessary to designate one model 310 as having 
special significance versus all other models in a file. For example, certain elements may 
contain resources that are shared among all models in the file. In this case a specially 
named model is stored directly in the root storage 210, called the control model 217. 
10 The control model 217 is in the same format as all other models 310 other than its 
special name and location in the root storage 210. When referring to the control model, 
the term "global" may be placed before the terms graphic element list 311, control 
0 element list 312, element chunk 320, graphic element, control element, etc. to indicate 
C; that the item refers to a resource which is shared among all of the models. In an 
rj5 alternate embodiment according to the present invention, control model 217 includes 
jjj only one element stream, having neither a graphic element stream nor a control 
! 4 element stream. 

9 The number of models 310 which may be stored in a model directory 300, is only 

w constrained by the amount of memory in the main storage area 30. Therefore, if there 
|20 was an infinite amount of memory in the main storage area 30, then there could be an 

infinite number of models 31 0 in the model directory 300. 
^ As shown in Figure 5, every model 310 contains a specially named stream 

called a model header stream 314 to hold information that is specific to the model 310. 

For example, the model header stream 314 can contain the model name, the units (e.g. 
25 metric/English), the geometric range of the model 310, etc., or any other information 

that may be relevant to know about the model 310 without actually having to load the 

model into a memory. 

Within a specific model 310, elements can be further classified according to their 

meaning in the model. Each such category of elements is stored in a sub-storage of the 
30 model 310 called an element list storage 315. For example, in CAD applications, 

elements are often classified into two types: graphic and control. Graphic elements have 
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a physical representation that occupies geometric space and can be drawn on a 
computer screen or a plotter. Control elements hold information pertinent to the model, 
but that does not, on it's own, have a physical representation. As shown in Figure 6, the 
graphic elements can be stored in a graphic element list 311 and the control elements 
5 can be stored in the control element list 312. Further categories of elements can be 
stored in additional element lists 313. 

Within an element list storage 315 (e.g., graphic element list 311 and control 
element list 312), individual elements are stored in groupings, or "element chunks" 320 
as shown in Figure 5. Each element chunk 320 is given a unique name preferably 
10 derived from its chunk number (e.g. "chunkOOl", "chunk002", etc.) so that the number of 
such chunks can be determined merely by iterating over the names in the element list 
storage 315. Alternatively, an element list header stream could be created to hold the 
^ stream names of the element chunks, but that would require the element header to be 

updated as element chunks are added and removed. 
Sj5 One purpose for dividing the elements stored in an element list storage 315 

|j among a series of element chunks 320, is to permit compression and encryption of the 
M element chunks 320 without requiring that the entire element list 315 held in the model 
be rewritten when any individual element is modified. This can be particularly important 
g in certain types of applications, such as CAD applications, where the element data 
p0 tends to be large but sparse. In this case, many well-known compression algorithms can 
}f; yield significant ratios of the original size of the data to the compressed size of the data. 
M Compressing element data can be a significant factor in the performance of the 
applications in terms of disk space, file I/O requirements, and network transfer times. 
Similarly, encryption algorithms are designed to accept original data and create 
25 encrypted data. However, in many applications such as CAD applications, small 
subsets of elements can be modified frequently and must be written to the disk often, 
both for reliability concerns and for real-time sharing of element data with other users on 
a network. In this case it would be impractical and unacceptable to have to re- 
compress, re-encrypt, and rewrite an entire element list 315 every time any element 
30 changes. 
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Several approaches can be used to determine how to sub-divide an element list 
315 into a series of element chunks 320. One approach is to fill element chunks 320 
with elements until the chunk size reaches a pre-determined limit. This will tend to make 
the pre-compressed size of each element chunk 320 approximately the same. In the 
preferred embodiment of the present invention, the size of an element chunk 320 is 
determined by choosing a fixed number of elements. This approach makes the 
maintenance of element chunks 320 straightforward. However, certain applications may 
require that certain groups of elements be considered "atomic," and be stored and 
retrieved as a single unit. In this case, the entire group should preferably be stored in a 
single element chunk 320. For example, in a CAD application where the size of an 
element varies but tends to approximate 100-200 bytes, a chunk size of 1000 elements 
has been demonstrated to work well. As should be obvious, the number of elements in 
any given element chunk 320 within an element list 315 need not be the same as all 
other element chunks 320 in the same element list 315. This can often be the case, 
either due to application grouping of elements that must all reside in the same element 
chunk 320, or due to element deletions after their original insertion into an element 
chunk 320. 

Each element chunk 320 preferably has a fixed header 325 (element chunk 
header) that can contain information necessary to read the elements from the chunk 
320 into memory. For example, the fixed header 325 should contain the number of 
elements in the element chunk 320, so that value can be known before reading the 
element data 330 from the element chunk 320. The fixed header 325 can also contain 
additional information, such as the compression and encryption scheme used for the 
element data 330 in the case where it is desirable to allow such compression and 
encryption scheme to vary from element chunk 320 to element chunk 320 within an 
element list 315. 

In each element chunk 320 immediately following the fixed header 325 at the 
offset equal to the size of the fixed header 325, is the element data 330. Depending on 
the values of the compression and encryption flags in the fixed header, the element 
data 330 can be either in "raw" (uncompressed and unencrypted) form or in 
compressed and/or encrypted form. 
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Figure 8A shows the creation and storage of element chunks in accordance with 
the present invention. The process of creating a file according to the file format of the 
present invention begins (400) by creating a root storage 210 (402). At least one model 
directory 300, having at least one model 310 is created (404), and stored in the root 
5 storage 210 (406). A file header 212, session information 213, manifest 214, file 
properties 215 are preferably created and stored in the root storage 210 as shown in 
steps 406-414 of Figure 8A. These items can be created and stored in any order in the 
root storage 210. in addition as discussed above, other applications can create other 
storages and streams 211 within a file created by the present invention without 
10 corrupting the other items in the file and these storages and stream 211 are also stored 
in the root storage 210, which is shown in steps 416 and 406 in Figure 8A. 

As shown in Figure 8B, each model 310 preferably contains at least one element 
f , F list (452). (However, each model 310 is not required to contain an element list 315.) For 
C example, each model 310 may contain a graphic element list 311 and a control element 
rj5 list 312. Each element chunk 320 contains a predetermined or preselected number of 
|( elements, wherein the actual number of elements in each element chunk 320 does not 
H have to be the same as discussed previously (454). The preselected number can be a 
g maximum number of elements. As each element, associated with a file format 200, is 
j;j created, each element is allocated to an element chunk 320 in an element list 315 
|20 (456). Subsequently, the element chunk 320 is preferably compressed (458, 460). 
p However, compression is not necessary (458). Each compressed element chunk 320 
H may also be encrypted (462, 464). However, encryption is not necessary (462). 
Subsequently, the element chunk 320 is stored in the storage area (466). The element 
chunk 320 stored in the storage area may or may not be encrypted, compressed or both 
25 encrypted and compressed (466). If additional elements require storage (468), then 
steps 450-468 are repeated until all of the elements are allocated to element chunks 
320 and the element chunks 320 are stored in the storage area. 

Figure 9A shows a block diagram for automatically storing element chunks 320 
having modified elements in accordance with another embodiment of the present 
30 invention (steps 500-518). The process begins (500) by accessing a file having a file 
format 200 of the present invention in a storage area such as main storage area 30, and 
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storing the file in temporary memory (503, 504). The temporary memory is preferably 
located in a workstation 20. A user, who is granted permission to edit the contents of the 
file in temporary memory, modifies the elements in the temporary memory (506). Each 
time a user modifies an element, a "dirty" flag is set to indicate the element chunk 320, 
containing the element that has been modified. If there is a period of idle use during 
which time the user has not made additional changes to the elements (508), the 
computer network 10 determines whether any dirty flags have been set. If a modification 
has been made, the computer network 10 identifies all of the element chunks 320 
containing modified elements (512). The modified element chunks 320 containing the 
modified elements are compressed (514) and stored in the main storage area 30 by 
overwriting the unmodified element chunk 320 in the storage area 30 (516-518). 

Alternatively, Figure 9B shows a block diagram for automatically storing element 
chunks 320 having modified elements in accordance with another embodiment of the 
present invention (steps 500-520). The process begins (500) by accessing a file having 
a file format 200 of the present invention in a storage area such as main storage area 
30, and storing the file in temporary memory (503, 504). The temporary memory is 
preferably located in a workstation 20. A user, who is granted permission to edit the 
contents of the file in temporary memory, modifies the elements in the temporary 
memory (506). Each time a user modifies an element, a "dirty" flag is set to indicate the 
element chunk 320, containing the element that has been modified. If there is a period 
of idle use during which time the user has not made additional changes to the elements 
(508), the computer network 10 determines whether any dirty flags have been set. If a 
modification has been made, the computer network 10 identifies all of the element 
chunks 320 containing modified elements (512). The modified element chunks 320 
containing the modified elements are encrypted (513), and stored in the main storage 
area 30 by overwriting the unmodified element chunk 320 in the storage area 30 (519- 
520). 

Alternatively, Figure 10 shows a block diagram for storing element chunks 320 
having modified elements in accordance with another embodiment of the present 
invention (steps 500-518). In this embodiment, the element data 330 are encrypted and 
compressed in the main storage area 30. Therefore, in order to access a file from the 
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main storage area 30 and store the file in the temporary memory, the element data 330 
must be decrypted and decompressed (503). The compression operation preferably 
precedes the encryption operation. However, these operations may be reversed (514- 
515). 

5 Figure 11A shows a block diagram for automatically storing streams having 

modified elements in accordance with another embodiment of the present invention. In 
this embodiment, at least one element in an element chunk 320 is modified on a first 
workstation 20 (600,602) by a first user, and each modified element chunk 320 is 
incrementally saved during periods of idle use (604). A second user at a second 
10 workstation 20 may have been given permission to view or edit the element chunks 320 
containing modified elements. If the second user is on a list of individuals who are 
notified of updates to files, then a signal may be sent from the computer network 10 to 
K the second workstation 20 to advise the second user that the file has been updated 
%i (606, 608, 610). If the second user is not automatically advised of changes by the 
rt|5 computer network 10, then the second user must periodically check for updates to the 
f. file (606). 

y Regardless of how the second user is notified of modifications of elements in 

g element chunks 320 in models 315 of model directories 310 in a file, the second user 
}*{ requests, by way of the second workstation, the modified element chunks 320 (612). 
110 The modified element chunks are accessed based on the request from the second 
jn workstation (614) and the modified element chunks are decompressed (616) by the 
^ second workstation. The modified elements are viewed on the second workstation (618, 
620). Alternatively, if the modified element chunks 320 are encrypted instead of 
compressed, the modified element chunks are decrypted (615) by the second 
25 workstation as shown in Figure 11B, so that they be viewed on the second workstation 
(618,620). 

Figure 12 shows a block diagram for automatically storing element chunks 320 
having modified elements in accordance with another embodiment of the present 
invention. In this embodiment, at least one element is modified on a first workstation 20 
30 (600, 603) by a first user, and the modified elements are incrementally saved during 
periods of idle use (605). A second user at a second workstation 20 may have been 
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given permission to view or edit element chunks 320 containing modified elements. If 
the second user is on a list of individuals who are notified of updates to files, then a 
signal may be sent from the computer network 10 to the second workstation 20 to 
advise the second user that the file has been updated (606, 608, 610). If the second 
user is not automatically advised of changes by the computer network 10, then the 
second user must periodically check for updates to the file (606). 

Regardless of how the second user is notified of modifications to elements in 
element chunks 320 of element lists 310, the second user requests by way of the 
second workstation 20 the modified element chunks 320 (612). The modified element 
chunks 320 are accessed based on the request from the second workstation 20 (613) 
and the modified element chunks 20 are decrypted (615) and/or decompressed (616) by 
the second workstation 20. The modified streams are viewed on the second workstation 
20 (618, 620). 

While the invention has been described in detail with reference to specific and 
preferred embodiments, it will be appreciated that various modifications and variations 
will be apparent to the artisan. All such modifications and embodiments as may occur to 
one skilled in the art are intended to be within the scope of the appended claims. 
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